Overview

Dataset statistics

Number of variables35
Number of observations1470
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory402.1 KiB
Average record size in memory280.1 B

Variable types

Numeric15
Boolean3
Categorical17

Warnings

EmployeeCount has constant value "1" Constant
Over18 has constant value "True" Constant
StandardHours has constant value "80" Constant
JobLevel is highly correlated with MonthlyIncomeHigh correlation
MonthlyIncome is highly correlated with JobLevelHigh correlation
WorkLifeBalance is highly correlated with StandardHours and 2 other fieldsHigh correlation
StandardHours is highly correlated with WorkLifeBalance and 18 other fieldsHigh correlation
JobRole is highly correlated with StandardHours and 3 other fieldsHigh correlation
JobInvolvement is highly correlated with StandardHours and 2 other fieldsHigh correlation
OverTime is highly correlated with StandardHours and 2 other fieldsHigh correlation
StockOptionLevel is highly correlated with StandardHours and 2 other fieldsHigh correlation
RelationshipSatisfaction is highly correlated with StandardHours and 2 other fieldsHigh correlation
Department is highly correlated with StandardHours and 3 other fieldsHigh correlation
JobLevel is highly correlated with StandardHours and 2 other fieldsHigh correlation
Gender is highly correlated with StandardHours and 2 other fieldsHigh correlation
BusinessTravel is highly correlated with StandardHours and 2 other fieldsHigh correlation
Over18 is highly correlated with WorkLifeBalance and 18 other fieldsHigh correlation
MaritalStatus is highly correlated with StandardHours and 2 other fieldsHigh correlation
JobSatisfaction is highly correlated with StandardHours and 2 other fieldsHigh correlation
EmployeeCount is highly correlated with WorkLifeBalance and 18 other fieldsHigh correlation
Education is highly correlated with StandardHours and 2 other fieldsHigh correlation
Attrition is highly correlated with StandardHours and 2 other fieldsHigh correlation
EducationField is highly correlated with StandardHours and 2 other fieldsHigh correlation
EnvironmentSatisfaction is highly correlated with StandardHours and 2 other fieldsHigh correlation
PerformanceRating is highly correlated with StandardHours and 2 other fieldsHigh correlation
EmployeeNumber has unique values Unique
NumCompaniesWorked has 197 (13.4%) zeros Zeros
TrainingTimesLastYear has 54 (3.7%) zeros Zeros
YearsAtCompany has 44 (3.0%) zeros Zeros
YearsInCurrentRole has 244 (16.6%) zeros Zeros
YearsSinceLastPromotion has 581 (39.5%) zeros Zeros
YearsWithCurrManager has 263 (17.9%) zeros Zeros

Reproduction

Analysis started2021-03-17 22:19:38.776843
Analysis finished2021-03-17 22:20:15.040288
Duration36.26 seconds
Software versionpandas-profiling v2.12.0
Download configurationconfig.yaml

Variables

Age
Real number (ℝ≥0)

Distinct43
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.92380952
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:15.184521image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile24
Q130
median36
Q343
95-th percentile54
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.135373489
Coefficient of variation (CV)0.2474114564
Kurtosis-0.4041451372
Mean36.92380952
Median Absolute Deviation (MAD)6
Skewness0.4132863019
Sum54278
Variance83.45504879
MonotocityNot monotonic
2021-03-17T19:20:15.340471image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
3578
 
5.3%
3477
 
5.2%
3169
 
4.7%
3669
 
4.7%
2968
 
4.6%
3261
 
4.1%
3060
 
4.1%
3358
 
3.9%
3858
 
3.9%
4057
 
3.9%
Other values (33)815
55.4%
ValueCountFrequency (%)
188
0.5%
199
0.6%
2011
0.7%
2113
0.9%
2216
1.1%
ValueCountFrequency (%)
605
 
0.3%
5910
0.7%
5814
1.0%
574
 
0.3%
5614
1.0%

Attrition
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
False
1233 
True
237 
ValueCountFrequency (%)
False1233
83.9%
True237
 
16.1%
2021-03-17T19:20:15.447035image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

BusinessTravel
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Travel_Rarely
1043 
Travel_Frequently
277 
Non-Travel
150 

Length

Max length17
Median length13
Mean length13.44761905
Min length10

Characters and Unicode

Total characters19768
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTravel_Rarely
2nd rowTravel_Frequently
3rd rowTravel_Rarely
4th rowTravel_Frequently
5th rowTravel_Rarely
ValueCountFrequency (%)
Travel_Rarely1043
71.0%
Travel_Frequently277
 
18.8%
Non-Travel150
 
10.2%
2021-03-17T19:20:15.641731image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:15.728322image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
travel_rarely1043
71.0%
travel_frequently277
 
18.8%
non-travel150
 
10.2%

Most occurring characters

ValueCountFrequency (%)
e3067
15.5%
r2790
14.1%
l2790
14.1%
a2513
12.7%
T1470
7.4%
v1470
7.4%
_1320
6.7%
y1320
6.7%
R1043
 
5.3%
n427
 
2.2%
Other values (7)1558
7.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter15358
77.7%
Uppercase Letter2940
 
14.9%
Connector Punctuation1320
 
6.7%
Dash Punctuation150
 
0.8%

Most frequent character per category

ValueCountFrequency (%)
e3067
20.0%
r2790
18.2%
l2790
18.2%
a2513
16.4%
v1470
9.6%
y1320
8.6%
n427
 
2.8%
q277
 
1.8%
u277
 
1.8%
t277
 
1.8%
ValueCountFrequency (%)
T1470
50.0%
R1043
35.5%
F277
 
9.4%
N150
 
5.1%
ValueCountFrequency (%)
_1320
100.0%
ValueCountFrequency (%)
-150
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin18298
92.6%
Common1470
 
7.4%

Most frequent character per script

ValueCountFrequency (%)
e3067
16.8%
r2790
15.2%
l2790
15.2%
a2513
13.7%
T1470
8.0%
v1470
8.0%
y1320
7.2%
R1043
 
5.7%
n427
 
2.3%
F277
 
1.5%
Other values (5)1131
 
6.2%
ValueCountFrequency (%)
_1320
89.8%
-150
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII19768
100.0%

Most frequent character per block

ValueCountFrequency (%)
e3067
15.5%
r2790
14.1%
l2790
14.1%
a2513
12.7%
T1470
7.4%
v1470
7.4%
_1320
6.7%
y1320
6.7%
R1043
 
5.3%
n427
 
2.2%
Other values (7)1558
7.9%

DailyRate
Real number (ℝ≥0)

Distinct886
Distinct (%)60.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean802.4857143
Minimum102
Maximum1499
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:15.840018image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum102
5-th percentile165.35
Q1465
median802
Q31157
95-th percentile1424.1
Maximum1499
Range1397
Interquartile range (IQR)692

Descriptive statistics

Standard deviation403.5090999
Coefficient of variation (CV)0.5028240288
Kurtosis-1.203822808
Mean802.4857143
Median Absolute Deviation (MAD)344
Skewness-0.003518568352
Sum1179654
Variance162819.5937
MonotocityNot monotonic
2021-03-17T19:20:15.971744image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6916
 
0.4%
10825
 
0.3%
3295
 
0.3%
13295
 
0.3%
5305
 
0.3%
4085
 
0.3%
7154
 
0.3%
5894
 
0.3%
9064
 
0.3%
3504
 
0.3%
Other values (876)1423
96.8%
ValueCountFrequency (%)
1021
0.1%
1031
0.1%
1041
0.1%
1051
0.1%
1061
0.1%
ValueCountFrequency (%)
14991
 
0.1%
14981
 
0.1%
14962
0.1%
14953
0.2%
14921
 
0.1%

Department
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Research & Development
961 
Sales
446 
Human Resources
 
63

Length

Max length22
Median length22
Mean length16.54217687
Min length5

Characters and Unicode

Total characters24317
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSales
2nd rowResearch & Development
3rd rowResearch & Development
4th rowResearch & Development
5th rowResearch & Development
ValueCountFrequency (%)
Research & Development961
65.4%
Sales446
30.3%
Human Resources63
 
4.3%
2021-03-17T19:20:16.232345image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:16.327408image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
development961
27.8%
research961
27.8%
961
27.8%
sales446
12.9%
human63
 
1.8%
resources63
 
1.8%

Most occurring characters

ValueCountFrequency (%)
e5377
22.1%
1985
 
8.2%
s1533
 
6.3%
a1470
 
6.0%
l1407
 
5.8%
R1024
 
4.2%
r1024
 
4.2%
c1024
 
4.2%
o1024
 
4.2%
m1024
 
4.2%
Other values (10)7425
30.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter18877
77.6%
Uppercase Letter2494
 
10.3%
Space Separator1985
 
8.2%
Other Punctuation961
 
4.0%

Most frequent character per category

ValueCountFrequency (%)
e5377
28.5%
s1533
 
8.1%
a1470
 
7.8%
l1407
 
7.5%
r1024
 
5.4%
c1024
 
5.4%
o1024
 
5.4%
m1024
 
5.4%
n1024
 
5.4%
h961
 
5.1%
Other values (4)3009
15.9%
ValueCountFrequency (%)
R1024
41.1%
D961
38.5%
S446
17.9%
H63
 
2.5%
ValueCountFrequency (%)
1985
100.0%
ValueCountFrequency (%)
&961
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin21371
87.9%
Common2946
 
12.1%

Most frequent character per script

ValueCountFrequency (%)
e5377
25.2%
s1533
 
7.2%
a1470
 
6.9%
l1407
 
6.6%
R1024
 
4.8%
r1024
 
4.8%
c1024
 
4.8%
o1024
 
4.8%
m1024
 
4.8%
n1024
 
4.8%
Other values (8)5440
25.5%
ValueCountFrequency (%)
1985
67.4%
&961
32.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII24317
100.0%

Most frequent character per block

ValueCountFrequency (%)
e5377
22.1%
1985
 
8.2%
s1533
 
6.3%
a1470
 
6.0%
l1407
 
5.8%
R1024
 
4.2%
r1024
 
4.2%
c1024
 
4.2%
o1024
 
4.2%
m1024
 
4.2%
Other values (10)7425
30.5%

DistanceFromHome
Real number (ℝ≥0)

Distinct29
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.192517007
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:16.411515image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q314
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.106864436
Coefficient of variation (CV)0.8818982254
Kurtosis-0.2248334049
Mean9.192517007
Median Absolute Deviation (MAD)5
Skewness0.9581179957
Sum13513
Variance65.72125098
MonotocityNot monotonic
2021-03-17T19:20:16.525891image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
2211
14.4%
1208
14.1%
1086
 
5.9%
985
 
5.8%
384
 
5.7%
784
 
5.7%
880
 
5.4%
565
 
4.4%
464
 
4.4%
659
 
4.0%
Other values (19)444
30.2%
ValueCountFrequency (%)
1208
14.1%
2211
14.4%
384
 
5.7%
464
 
4.4%
565
 
4.4%
ValueCountFrequency (%)
2927
1.8%
2823
1.6%
2712
0.8%
2625
1.7%
2525
1.7%

Education
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
572 
4
398 
2
282 
1
170 
5
 
48

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row4
5th row1
ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%
2021-03-17T19:20:16.785590image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:16.861164image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%

Most occurring characters

ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
3572
38.9%
4398
27.1%
2282
19.2%
1170
 
11.6%
548
 
3.3%

EducationField
Categorical

HIGH CORRELATION

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Life Sciences
606 
Medical
464 
Marketing
159 
Technical Degree
132 
Other
82 

Length

Max length16
Median length13
Mean length10.53333333
Min length5

Characters and Unicode

Total characters15484
Distinct characters26
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLife Sciences
2nd rowLife Sciences
3rd rowOther
4th rowLife Sciences
5th rowMedical
ValueCountFrequency (%)
Life Sciences606
41.2%
Medical464
31.6%
Marketing159
 
10.8%
Technical Degree132
 
9.0%
Other82
 
5.6%
Human Resources27
 
1.8%
2021-03-17T19:20:17.073089image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:17.151030image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
life606
27.1%
sciences606
27.1%
medical464
20.8%
marketing159
 
7.1%
technical132
 
5.9%
degree132
 
5.9%
other82
 
3.7%
human27
 
1.2%
resources27
 
1.2%

Most occurring characters

ValueCountFrequency (%)
e3105
20.1%
i1967
12.7%
c1967
12.7%
n924
 
6.0%
a782
 
5.1%
765
 
4.9%
s660
 
4.3%
M623
 
4.0%
L606
 
3.9%
f606
 
3.9%
Other values (16)3479
22.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter12484
80.6%
Uppercase Letter2235
 
14.4%
Space Separator765
 
4.9%

Most frequent character per category

ValueCountFrequency (%)
e3105
24.9%
i1967
15.8%
c1967
15.8%
n924
 
7.4%
a782
 
6.3%
s660
 
5.3%
f606
 
4.9%
l596
 
4.8%
d464
 
3.7%
r400
 
3.2%
Other values (7)1013
 
8.1%
ValueCountFrequency (%)
M623
27.9%
L606
27.1%
S606
27.1%
T132
 
5.9%
D132
 
5.9%
O82
 
3.7%
H27
 
1.2%
R27
 
1.2%
ValueCountFrequency (%)
765
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin14719
95.1%
Common765
 
4.9%

Most frequent character per script

ValueCountFrequency (%)
e3105
21.1%
i1967
13.4%
c1967
13.4%
n924
 
6.3%
a782
 
5.3%
s660
 
4.5%
M623
 
4.2%
L606
 
4.1%
f606
 
4.1%
S606
 
4.1%
Other values (15)2873
19.5%
ValueCountFrequency (%)
765
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII15484
100.0%

Most frequent character per block

ValueCountFrequency (%)
e3105
20.1%
i1967
12.7%
c1967
12.7%
n924
 
6.0%
a782
 
5.1%
765
 
4.9%
s660
 
4.3%
M623
 
4.0%
L606
 
3.9%
f606
 
3.9%
Other values (16)3479
22.5%

EmployeeCount
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
1
1470 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
11470
100.0%
2021-03-17T19:20:17.367329image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:17.435170image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
11470
100.0%

Most occurring characters

ValueCountFrequency (%)
11470
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
11470
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
11470
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
11470
100.0%

EmployeeNumber
Real number (ℝ≥0)

UNIQUE

Distinct1470
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1024.865306
Minimum1
Maximum2068
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:17.868534image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile96.45
Q1491.25
median1020.5
Q31555.75
95-th percentile1967.55
Maximum2068
Range2067
Interquartile range (IQR)1064.5

Descriptive statistics

Standard deviation602.0243348
Coefficient of variation (CV)0.5874180063
Kurtosis-1.223178906
Mean1024.865306
Median Absolute Deviation (MAD)533.5
Skewness0.01657401958
Sum1506552
Variance362433.2997
MonotocityStrictly increasing
2021-03-17T19:20:18.069348image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20461
 
0.1%
6411
 
0.1%
6441
 
0.1%
6451
 
0.1%
6471
 
0.1%
6481
 
0.1%
6491
 
0.1%
6501
 
0.1%
6521
 
0.1%
6531
 
0.1%
Other values (1460)1460
99.3%
ValueCountFrequency (%)
11
0.1%
21
0.1%
41
0.1%
51
0.1%
71
0.1%
ValueCountFrequency (%)
20681
0.1%
20651
0.1%
20641
0.1%
20621
0.1%
20611
0.1%

EnvironmentSatisfaction
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
453 
4
446 
2
287 
1
284 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row3
3rd row4
4th row4
5th row1
ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%
2021-03-17T19:20:18.382575image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:18.491741image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%

Most occurring characters

ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
3453
30.8%
4446
30.3%
2287
19.5%
1284
19.3%

Gender
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Male
882 
Female
588 

Length

Max length6
Median length4
Mean length4.8
Min length4

Characters and Unicode

Total characters7056
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowMale
3rd rowMale
4th rowFemale
5th rowMale
ValueCountFrequency (%)
Male882
60.0%
Female588
40.0%
2021-03-17T19:20:18.738469image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:18.820491image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
male882
60.0%
female588
40.0%

Most occurring characters

ValueCountFrequency (%)
e2058
29.2%
a1470
20.8%
l1470
20.8%
M882
12.5%
F588
 
8.3%
m588
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter5586
79.2%
Uppercase Letter1470
 
20.8%

Most frequent character per category

ValueCountFrequency (%)
e2058
36.8%
a1470
26.3%
l1470
26.3%
m588
 
10.5%
ValueCountFrequency (%)
M882
60.0%
F588
40.0%

Most occurring scripts

ValueCountFrequency (%)
Latin7056
100.0%

Most frequent character per script

ValueCountFrequency (%)
e2058
29.2%
a1470
20.8%
l1470
20.8%
M882
12.5%
F588
 
8.3%
m588
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII7056
100.0%

Most frequent character per block

ValueCountFrequency (%)
e2058
29.2%
a1470
20.8%
l1470
20.8%
M882
12.5%
F588
 
8.3%
m588
 
8.3%

HourlyRate
Real number (ℝ≥0)

Distinct71
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.89115646
Minimum30
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:18.919967image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile33
Q148
median66
Q383.75
95-th percentile97
Maximum100
Range70
Interquartile range (IQR)35.75

Descriptive statistics

Standard deviation20.32942759
Coefficient of variation (CV)0.3085304415
Kurtosis-1.196398456
Mean65.89115646
Median Absolute Deviation (MAD)18
Skewness-0.0323109529
Sum96860
Variance413.2856263
MonotocityNot monotonic
2021-03-17T19:20:19.053713image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6629
 
2.0%
4228
 
1.9%
9828
 
1.9%
4828
 
1.9%
8428
 
1.9%
7927
 
1.8%
9627
 
1.8%
5727
 
1.8%
5226
 
1.8%
8726
 
1.8%
Other values (61)1196
81.4%
ValueCountFrequency (%)
3019
1.3%
3115
1.0%
3224
1.6%
3319
1.3%
3412
0.8%
ValueCountFrequency (%)
10019
1.3%
9920
1.4%
9828
1.9%
9721
1.4%
9627
1.8%

JobInvolvement
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
868 
2
375 
4
144 
1
 
83

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row2
3rd row2
4th row3
5th row3
ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%
2021-03-17T19:20:19.311140image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:19.387829image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%

Most occurring characters

ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
3868
59.0%
2375
25.5%
4144
 
9.8%
183
 
5.6%

JobLevel
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
1
543 
2
534 
3
218 
4
106 
5
69 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%
2021-03-17T19:20:19.641401image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:19.745711image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%

Most occurring characters

ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
1543
36.9%
2534
36.3%
3218
14.8%
4106
 
7.2%
569
 
4.7%

JobRole
Categorical

HIGH CORRELATION

Distinct9
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Sales Executive
326 
Research Scientist
292 
Laboratory Technician
259 
Manufacturing Director
145 
Healthcare Representative
131 
Other values (4)
317 

Length

Max length25
Median length18
Mean length18.0707483
Min length7

Characters and Unicode

Total characters26564
Distinct characters29
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSales Executive
2nd rowResearch Scientist
3rd rowLaboratory Technician
4th rowResearch Scientist
5th rowLaboratory Technician
ValueCountFrequency (%)
Sales Executive326
22.2%
Research Scientist292
19.9%
Laboratory Technician259
17.6%
Manufacturing Director145
9.9%
Healthcare Representative131
8.9%
Manager102
 
6.9%
Sales Representative83
 
5.6%
Research Director80
 
5.4%
Human Resources52
 
3.5%
2021-03-17T19:20:20.020604image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:20.121107image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
sales409
14.4%
research372
13.1%
executive326
11.5%
scientist292
10.3%
laboratory259
9.1%
technician259
9.1%
director225
7.9%
representative214
7.5%
manufacturing145
 
5.1%
healthcare131
 
4.6%
Other values (3)206
7.3%

Most occurring characters

ValueCountFrequency (%)
e3905
14.7%
a2580
 
9.7%
t2098
 
7.9%
c2061
 
7.8%
i2012
 
7.6%
r1984
 
7.5%
n1468
 
5.5%
s1391
 
5.2%
1368
 
5.1%
o795
 
3.0%
Other values (19)6902
26.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter22358
84.2%
Uppercase Letter2838
 
10.7%
Space Separator1368
 
5.1%

Most frequent character per category

ValueCountFrequency (%)
e3905
17.5%
a2580
11.5%
t2098
9.4%
c2061
9.2%
i2012
9.0%
r1984
8.9%
n1468
 
6.6%
s1391
 
6.2%
o795
 
3.6%
h762
 
3.4%
Other values (10)3302
14.8%
ValueCountFrequency (%)
S701
24.7%
R638
22.5%
E326
11.5%
L259
 
9.1%
T259
 
9.1%
M247
 
8.7%
D225
 
7.9%
H183
 
6.4%
ValueCountFrequency (%)
1368
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin25196
94.9%
Common1368
 
5.1%

Most frequent character per script

ValueCountFrequency (%)
e3905
15.5%
a2580
10.2%
t2098
 
8.3%
c2061
 
8.2%
i2012
 
8.0%
r1984
 
7.9%
n1468
 
5.8%
s1391
 
5.5%
o795
 
3.2%
h762
 
3.0%
Other values (18)6140
24.4%
ValueCountFrequency (%)
1368
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII26564
100.0%

Most frequent character per block

ValueCountFrequency (%)
e3905
14.7%
a2580
 
9.7%
t2098
 
7.9%
c2061
 
7.8%
i2012
 
7.6%
r1984
 
7.5%
n1468
 
5.5%
s1391
 
5.2%
1368
 
5.1%
o795
 
3.0%
Other values (19)6902
26.0%

JobSatisfaction
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
4
459 
3
442 
1
289 
2
280 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row2
3rd row3
4th row3
5th row2
ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%
2021-03-17T19:20:20.412307image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:20.489409image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%

Most occurring characters

ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
4459
31.2%
3442
30.1%
1289
19.7%
2280
19.0%

MaritalStatus
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
Married
673 
Single
470 
Divorced
327 

Length

Max length8
Median length7
Mean length6.902721088
Min length6

Characters and Unicode

Total characters10147
Distinct characters14
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSingle
2nd rowMarried
3rd rowSingle
4th rowMarried
5th rowMarried
ValueCountFrequency (%)
Married673
45.8%
Single470
32.0%
Divorced327
22.2%
2021-03-17T19:20:20.714975image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:20.807549image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
married673
45.8%
single470
32.0%
divorced327
22.2%

Most occurring characters

ValueCountFrequency (%)
r1673
16.5%
i1470
14.5%
e1470
14.5%
d1000
9.9%
M673
6.6%
a673
6.6%
S470
 
4.6%
n470
 
4.6%
g470
 
4.6%
l470
 
4.6%
Other values (4)1308
12.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter8677
85.5%
Uppercase Letter1470
 
14.5%

Most frequent character per category

ValueCountFrequency (%)
r1673
19.3%
i1470
16.9%
e1470
16.9%
d1000
11.5%
a673
7.8%
n470
 
5.4%
g470
 
5.4%
l470
 
5.4%
v327
 
3.8%
o327
 
3.8%
ValueCountFrequency (%)
M673
45.8%
S470
32.0%
D327
22.2%

Most occurring scripts

ValueCountFrequency (%)
Latin10147
100.0%

Most frequent character per script

ValueCountFrequency (%)
r1673
16.5%
i1470
14.5%
e1470
14.5%
d1000
9.9%
M673
6.6%
a673
6.6%
S470
 
4.6%
n470
 
4.6%
g470
 
4.6%
l470
 
4.6%
Other values (4)1308
12.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII10147
100.0%

Most frequent character per block

ValueCountFrequency (%)
r1673
16.5%
i1470
14.5%
e1470
14.5%
d1000
9.9%
M673
6.6%
a673
6.6%
S470
 
4.6%
n470
 
4.6%
g470
 
4.6%
l470
 
4.6%
Other values (4)1308
12.9%

MonthlyIncome
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1349
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6502.931293
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:20.912099image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1009
5-th percentile2097.9
Q12911
median4919
Q38379
95-th percentile17821.35
Maximum19999
Range18990
Interquartile range (IQR)5468

Descriptive statistics

Standard deviation4707.956783
Coefficient of variation (CV)0.7239745541
Kurtosis1.005232691
Mean6502.931293
Median Absolute Deviation (MAD)2199
Skewness1.369816681
Sum9559309
Variance22164857.07
MonotocityNot monotonic
2021-03-17T19:20:21.060063image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23424
 
0.3%
55623
 
0.2%
27413
 
0.2%
24513
 
0.2%
26103
 
0.2%
23803
 
0.2%
61423
 
0.2%
63473
 
0.2%
25593
 
0.2%
24043
 
0.2%
Other values (1339)1439
97.9%
ValueCountFrequency (%)
10091
0.1%
10511
0.1%
10521
0.1%
10811
0.1%
10911
0.1%
ValueCountFrequency (%)
199991
0.1%
199731
0.1%
199431
0.1%
199261
0.1%
198591
0.1%

MonthlyRate
Real number (ℝ≥0)

Distinct1427
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14313.1034
Minimum2094
Maximum26999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:21.218450image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum2094
5-th percentile3384.55
Q18047
median14235.5
Q320461.5
95-th percentile25431.9
Maximum26999
Range24905
Interquartile range (IQR)12414.5

Descriptive statistics

Standard deviation7117.786044
Coefficient of variation (CV)0.4972915967
Kurtosis-1.2149561
Mean14313.1034
Median Absolute Deviation (MAD)6206.5
Skewness0.01857780789
Sum21040262
Variance50662878.17
MonotocityNot monotonic
2021-03-17T19:20:21.376732image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42233
 
0.2%
91503
 
0.2%
66702
 
0.1%
73242
 
0.1%
46582
 
0.1%
215342
 
0.1%
161542
 
0.1%
130082
 
0.1%
123552
 
0.1%
60692
 
0.1%
Other values (1417)1448
98.5%
ValueCountFrequency (%)
20941
0.1%
20971
0.1%
21041
0.1%
21121
0.1%
21221
0.1%
ValueCountFrequency (%)
269991
0.1%
269971
0.1%
269681
0.1%
269591
0.1%
269561
0.1%

NumCompaniesWorked
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.693197279
Minimum0
Maximum9
Zeros197
Zeros (%)13.4%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:21.507613image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.498009006
Coefficient of variation (CV)0.9275254455
Kurtosis0.01021381669
Mean2.693197279
Median Absolute Deviation (MAD)1
Skewness1.026471112
Sum3959
Variance6.240048994
MonotocityNot monotonic
2021-03-17T19:20:21.604838image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1521
35.4%
0197
 
13.4%
3159
 
10.8%
2146
 
9.9%
4139
 
9.5%
774
 
5.0%
670
 
4.8%
563
 
4.3%
952
 
3.5%
849
 
3.3%
ValueCountFrequency (%)
0197
 
13.4%
1521
35.4%
2146
 
9.9%
3159
 
10.8%
4139
 
9.5%
ValueCountFrequency (%)
952
3.5%
849
3.3%
774
5.0%
670
4.8%
563
4.3%

Over18
Boolean

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
True
1470 
ValueCountFrequency (%)
True1470
100.0%
2021-03-17T19:20:21.678557image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

OverTime
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
False
1054 
True
416 
ValueCountFrequency (%)
False1054
71.7%
True416
 
28.3%
2021-03-17T19:20:21.722909image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

PercentSalaryHike
Real number (ℝ≥0)

Distinct15
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.20952381
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:21.798464image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.659937717
Coefficient of variation (CV)0.2406346025
Kurtosis-0.3005982221
Mean15.20952381
Median Absolute Deviation (MAD)2
Skewness0.8211279756
Sum22358
Variance13.39514409
MonotocityNot monotonic
2021-03-17T19:20:21.908639image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
11210
14.3%
13209
14.2%
14201
13.7%
12198
13.5%
15101
6.9%
1889
6.1%
1782
 
5.6%
1678
 
5.3%
1976
 
5.2%
2256
 
3.8%
Other values (5)170
11.6%
ValueCountFrequency (%)
11210
14.3%
12198
13.5%
13209
14.2%
14201
13.7%
15101
6.9%
ValueCountFrequency (%)
2518
 
1.2%
2421
 
1.4%
2328
1.9%
2256
3.8%
2148
3.3%

PerformanceRating
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
1244 
4
226 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row3
4th row3
5th row3
ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%
2021-03-17T19:20:22.165777image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:22.240147image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%

Most occurring characters

ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
31244
84.6%
4226
 
15.4%

RelationshipSatisfaction
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
459 
4
432 
2
303 
1
276 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row4
3rd row2
4th row3
5th row4
ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%
2021-03-17T19:20:22.457823image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:22.537236image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%

Most occurring characters

ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
3459
31.2%
4432
29.4%
2303
20.6%
1276
18.8%

StandardHours
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
80
1470 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2940
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row80
2nd row80
3rd row80
4th row80
5th row80
ValueCountFrequency (%)
801470
100.0%
2021-03-17T19:20:22.744853image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:22.813262image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
801470
100.0%

Most occurring characters

ValueCountFrequency (%)
81470
50.0%
01470
50.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number2940
100.0%

Most frequent character per category

ValueCountFrequency (%)
81470
50.0%
01470
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common2940
100.0%

Most frequent character per script

ValueCountFrequency (%)
81470
50.0%
01470
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2940
100.0%

Most frequent character per block

ValueCountFrequency (%)
81470
50.0%
01470
50.0%

StockOptionLevel
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
0
631 
1
596 
2
158 
3
85 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row1
ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%
2021-03-17T19:20:22.993587image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:23.075373image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%

Most occurring characters

ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
0631
42.9%
1596
40.5%
2158
 
10.7%
385
 
5.8%

TotalWorkingYears
Real number (ℝ≥0)

Distinct40
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.27959184
Minimum0
Maximum40
Zeros11
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:23.188662image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.780781676
Coefficient of variation (CV)0.6898105701
Kurtosis0.9182695366
Mean11.27959184
Median Absolute Deviation (MAD)4
Skewness1.117171853
Sum16581
Variance60.54056348
MonotocityNot monotonic
2021-03-17T19:20:23.323385image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
10202
 
13.7%
6125
 
8.5%
8103
 
7.0%
996
 
6.5%
588
 
6.0%
181
 
5.5%
781
 
5.5%
463
 
4.3%
1248
 
3.3%
342
 
2.9%
Other values (30)541
36.8%
ValueCountFrequency (%)
011
 
0.7%
181
5.5%
231
 
2.1%
342
2.9%
463
4.3%
ValueCountFrequency (%)
402
 
0.1%
381
 
0.1%
374
0.3%
366
0.4%
353
0.2%

TrainingTimesLastYear
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.799319728
Minimum0
Maximum6
Zeros54
Zeros (%)3.7%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:23.454982image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.289270621
Coefficient of variation (CV)0.4605656896
Kurtosis0.494992986
Mean2.799319728
Median Absolute Deviation (MAD)1
Skewness0.5531241711
Sum4115
Variance1.662218734
MonotocityNot monotonic
2021-03-17T19:20:23.561259image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2547
37.2%
3491
33.4%
4123
 
8.4%
5119
 
8.1%
171
 
4.8%
665
 
4.4%
054
 
3.7%
ValueCountFrequency (%)
054
 
3.7%
171
 
4.8%
2547
37.2%
3491
33.4%
4123
 
8.4%
ValueCountFrequency (%)
665
 
4.4%
5119
 
8.1%
4123
 
8.4%
3491
33.4%
2547
37.2%

WorkLifeBalance
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.6 KiB
3
893 
2
344 
4
153 
1
 
80

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters1470
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row3
4th row3
5th row3
ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%
2021-03-17T19:20:23.808819image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category
2021-03-17T19:20:23.890117image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%

Most occurring characters

ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1470
100.0%

Most frequent character per category

ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%

Most occurring scripts

ValueCountFrequency (%)
Common1470
100.0%

Most frequent character per script

ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1470
100.0%

Most frequent character per block

ValueCountFrequency (%)
3893
60.7%
2344
 
23.4%
4153
 
10.4%
180
 
5.4%

YearsAtCompany
Real number (ℝ≥0)

ZEROS

Distinct37
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.008163265
Minimum0
Maximum40
Zeros44
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:23.983993image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q39
95-th percentile20
Maximum40
Range40
Interquartile range (IQR)6

Descriptive statistics

Standard deviation6.126525152
Coefficient of variation (CV)0.8741984056
Kurtosis3.935508756
Mean7.008163265
Median Absolute Deviation (MAD)3
Skewness1.764529454
Sum10302
Variance37.53431044
MonotocityNot monotonic
2021-03-17T19:20:24.124724image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
5196
13.3%
1171
11.6%
3128
8.7%
2127
8.6%
10120
8.2%
4110
 
7.5%
790
 
6.1%
982
 
5.6%
880
 
5.4%
676
 
5.2%
Other values (27)290
19.7%
ValueCountFrequency (%)
044
 
3.0%
1171
11.6%
2127
8.6%
3128
8.7%
4110
7.5%
ValueCountFrequency (%)
401
 
0.1%
371
 
0.1%
362
 
0.1%
341
 
0.1%
335
0.3%

YearsInCurrentRole
Real number (ℝ≥0)

ZEROS

Distinct19
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.229251701
Minimum0
Maximum18
Zeros244
Zeros (%)16.6%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:24.260263image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile11
Maximum18
Range18
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.623137035
Coefficient of variation (CV)0.856685128
Kurtosis0.4774207735
Mean4.229251701
Median Absolute Deviation (MAD)3
Skewness0.9173631563
Sum6217
Variance13.12712197
MonotocityNot monotonic
2021-03-17T19:20:24.374129image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
2372
25.3%
0244
16.6%
7222
15.1%
3135
 
9.2%
4104
 
7.1%
889
 
6.1%
967
 
4.6%
157
 
3.9%
637
 
2.5%
536
 
2.4%
Other values (9)107
 
7.3%
ValueCountFrequency (%)
0244
16.6%
157
 
3.9%
2372
25.3%
3135
 
9.2%
4104
 
7.1%
ValueCountFrequency (%)
182
 
0.1%
174
 
0.3%
167
0.5%
158
0.5%
1411
0.7%

YearsSinceLastPromotion
Real number (ℝ≥0)

ZEROS

Distinct16
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.187755102
Minimum0
Maximum15
Zeros581
Zeros (%)39.5%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:24.493751image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.222430279
Coefficient of variation (CV)1.472939213
Kurtosis3.612673115
Mean2.187755102
Median Absolute Deviation (MAD)1
Skewness1.984289983
Sum3216
Variance10.3840569
MonotocityNot monotonic
2021-03-17T19:20:24.612391image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0581
39.5%
1357
24.3%
2159
 
10.8%
776
 
5.2%
461
 
4.1%
352
 
3.5%
545
 
3.1%
632
 
2.2%
1124
 
1.6%
818
 
1.2%
Other values (6)65
 
4.4%
ValueCountFrequency (%)
0581
39.5%
1357
24.3%
2159
 
10.8%
352
 
3.5%
461
 
4.1%
ValueCountFrequency (%)
1513
0.9%
149
 
0.6%
1310
0.7%
1210
0.7%
1124
1.6%

YearsWithCurrManager
Real number (ℝ≥0)

ZEROS

Distinct18
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.123129252
Minimum0
Maximum17
Zeros263
Zeros (%)17.9%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2021-03-17T19:20:24.735927image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.568136121
Coefficient of variation (CV)0.8653951654
Kurtosis0.1710580839
Mean4.123129252
Median Absolute Deviation (MAD)3
Skewness0.833450992
Sum6061
Variance12.73159537
MonotocityNot monotonic
2021-03-17T19:20:24.856673image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2344
23.4%
0263
17.9%
7216
14.7%
3142
9.7%
8107
 
7.3%
498
 
6.7%
176
 
5.2%
964
 
4.4%
531
 
2.1%
629
 
2.0%
Other values (8)100
 
6.8%
ValueCountFrequency (%)
0263
17.9%
176
 
5.2%
2344
23.4%
3142
9.7%
498
 
6.7%
ValueCountFrequency (%)
177
0.5%
162
 
0.1%
155
 
0.3%
145
 
0.3%
1314
1.0%

Interactions

2021-03-17T19:19:45.194442image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.337278image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.472605image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.589475image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.713546image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.840073image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:45.954069image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.069852image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.190400image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.317429image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.437689image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.564629image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.692710image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.815555image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:46.940102image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.061956image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.188971image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.313944image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.450373image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.585055image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.712121image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:47.952475image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.078507image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.202385image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.321476image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.447080image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.569715image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.694458image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.843483image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:48.966881image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.087571image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.200869image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.324312image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.457729image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.603101image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.716824image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.841370image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:49.982532image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.113964image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.259171image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.393505image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.525922image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.659226image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.781277image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:50.902171image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.023188image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.146107image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.274420image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.391191image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.506941image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.761756image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:51.898482image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.032095image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.166566image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.292465image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.417560image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.542377image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.664332image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.798748image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:52.946023image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.069685image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.217850image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.350499image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.481470image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.611344image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.745799image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:53.878434image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.013494image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.147963image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.280097image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.415003image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.540892image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.680945image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.822709image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:54.956992image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.090420image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.215810image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.341194image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.473112image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.612535image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.743402image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:55.877612image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.012833image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.146654image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.282439image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.551515image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.679767image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.797256image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:56.910297image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.030078image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.153922image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.266454image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.384213image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.511172image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.628493image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.752126image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.872093image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:57.993336image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.114203image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.223948image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.346606image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.464341image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.576226image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.696287image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.828456image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:58.940598image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.058168image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.180529image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.296240image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.417304image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.543887image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.663177image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.782988image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:19:59.900570image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.032199image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.154502image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.273339image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.400250image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.529006image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.648010image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.766385image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:00.896089image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.020451image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.151369image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.280939image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.409958image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.538994image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.661264image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:01.792886image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.118949image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.259648image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.400105image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.539417image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.665061image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.791687image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:02.923787image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.058514image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.195570image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.331215image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.466361image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.602974image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.720600image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.845174image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:03.967123image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.086884image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.214602image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.345022image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.467589image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.585571image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.712329image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.844089image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:04.988256image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.124687image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.258255image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.393301image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.526874image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.665079image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.801333image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:05.934877image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.075313image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.217606image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.352505image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.486480image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.626163image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.770321image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:06.909462image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.050498image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.202389image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.345681image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.472147image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.610659image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.744337image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:07.875806image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.012275image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.151051image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.281747image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.410995image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.547820image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.687941image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.844465image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:08.989564image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.348948image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.499330image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.625861image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.748256image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.875222image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:09.991458image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.118155image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.247711image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.381550image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.505552image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.639458image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.769844image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:10.897017image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.027669image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.156790image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.283029image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.409468image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.541145image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.665980image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.784267image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:11.912720image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.039112image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.160097image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.279243image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.409439image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.546104image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.669466image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.812144image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-03-17T19:20:12.948800image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Correlations

2021-03-17T19:20:25.449249image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-03-17T19:20:25.867153image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-03-17T19:20:26.259827image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-03-17T19:20:26.680234image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.
2021-03-17T19:20:27.077807image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

2021-03-17T19:20:13.278656image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
A simple visualization of nullity by column.
2021-03-17T19:20:14.615404image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
041YesTravel_Rarely1102Sales12Life Sciences112Female9432Sales Executive4Single5993194798YYes11318008016405
149NoTravel_Frequently279Research & Development81Life Sciences123Male6122Research Scientist2Married5130249071YNo2344801103310717
237YesTravel_Rarely1373Research & Development22Other144Male9221Laboratory Technician3Single209023966YYes15328007330000
333NoTravel_Frequently1392Research & Development34Life Sciences154Female5631Research Scientist3Married2909231591YYes11338008338730
427NoTravel_Rarely591Research & Development21Medical171Male4031Laboratory Technician2Married3468166329YNo12348016332222
532NoTravel_Frequently1005Research & Development22Life Sciences184Male7931Laboratory Technician4Single3068118640YNo13338008227736
659NoTravel_Rarely1324Research & Development33Medical1103Female8141Laboratory Technician1Married267099644YYes204180312321000
730NoTravel_Rarely1358Research & Development241Life Sciences1114Male6731Laboratory Technician3Divorced2693133351YNo22428011231000
838NoTravel_Frequently216Research & Development233Life Sciences1124Male4423Manufacturing Director3Single952687870YNo214280010239718
936NoTravel_Rarely1299Research & Development273Medical1133Male9432Healthcare Representative3Married5237165776YNo133280217327777

Last rows

AgeAttritionBusinessTravelDailyRateDepartmentDistanceFromHomeEducationEducationFieldEmployeeCountEmployeeNumberEnvironmentSatisfactionGenderHourlyRateJobInvolvementJobLevelJobRoleJobSatisfactionMaritalStatusMonthlyIncomeMonthlyRateNumCompaniesWorkedOver18OverTimePercentSalaryHikePerformanceRatingRelationshipSatisfactionStandardHoursStockOptionLevelTotalWorkingYearsTrainingTimesLastYearWorkLifeBalanceYearsAtCompanyYearsInCurrentRoleYearsSinceLastPromotionYearsWithCurrManager
146029NoTravel_Rarely468Research & Development284Medical120544Female7321Research Scientist1Single378584891YNo14328005315404
146150YesTravel_Rarely410Sales283Marketing120554Male3923Sales Executive1Divorced10854165864YYes133280120333220
146239NoTravel_Rarely722Sales241Marketing120562Female6024Sales Executive4Married1203188280YNo1131801212220996
146331NoNon-Travel325Research & Development53Medical120572Male7432Manufacturing Director1Single993637870YNo193280010239417
146426NoTravel_Rarely1167Sales53Other120604Female3021Sales Representative3Single2966213780YNo18348005234200
146536NoTravel_Frequently884Research & Development232Medical120613Male4142Laboratory Technician4Married2571122904YNo173380117335203
146639NoTravel_Rarely613Research & Development61Medical120624Male4223Healthcare Representative1Married9991214574YNo15318019537717
146727NoTravel_Rarely155Research & Development43Life Sciences120642Male8742Manufacturing Director2Married614251741YYes20428016036203
146849NoTravel_Frequently1023Sales23Medical120654Male6322Sales Executive2Married5390132432YNo143480017329608
146934NoTravel_Rarely628Research & Development83Medical120682Male8242Laboratory Technician3Married4404102282YNo12318006344312